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CELLS 

(57) Abstract 

A vector for the integration of a gene into the genetic material of a mammalian host cell such that the gene may be ex- 
pressed by the host cell. The vector comprises a promoter and the gene and an immunoglobulin dominant control region derived 
from the mouse X immunoglobulin gene locus capable of eliciting host cell-type restricted, integration she independent, copy 
number dependent expression of the said gene. The DNasel super hypersensitive she exemplified are i) about 2 35 kb upstream 
of the CAP she of the rearranged X x gene, ii) about 2.5 kb upstream of the genomic segment or iii) about 30 kb down- 
stream of the rearranged X 1 gene. Mammalian host cells transformed with the vector are disclosed as are transgenic mam- 
mals transformed with the vector and a method of producing a polypeptide comprising culturing a transformed mammalian 
cell. A method of gene therapy comprising the steps of i) removing stem cells from the body of a mammal,.") optionally kill- 
ing stem cells remaining in the body, iii) transforming the removed stem cells with the vector containing a gene deficient or 
absent in the body, and iv) replacing the transformed stem cells in the body is also disclosed. Alflo disclosed is functional 
mouse immunoglobulin %i enhancer consisting of a DNA sequence comprising all or a functional part of the DNA se- 
quence between the EcoRI site 3.8 kb downstream of the Xho I site in the rearranged mouse X x gene and the SnaBI site 10 kb 
downstream of this Xho I site. The functional mouse immunoglobulin X x enhancer may comprise all or a fractional part of i) 
the 1 3 kb first HindJII to Hindlll DNA fragment downstream of the EcoRI site 3.8 kb downstream of the Xho I site in the rear- 
ranged mouse %i gene, ii) the 3.3 kb Hindlll to Hindlll DNA fragment downstream of the EcoRI site 3.8 kb downstream of the 
Xho I site in the rearranged mouse X\ gene and spanning the SnaBI site 10 kb downstream of this Xho I site. 
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Vector for integration site independent gene expression 
in mammalian host cells 

Field of the Invention 

5 The present invention relates to recombinant DNA technology 
and in particular to a vector useful for transfecting 
mammalian cells in vivo and in vj.tr o to obtain expression of 
a desired structural gene. The invention relates also to 
the use of such vectors in gene therapy and heterologous 
10 gene expression. 

Background to the Tnvention 

There is a continuing need for improved expression vectors 
15 exhibiting high levels of expression. In particular, 
expression vectors for use in mammalian cell lines are of 
increasing importance both for the industrial production of 
desired polypeptides and for the development of therapies 
for genetic disorders. 

20' 

There are many known examples of characterised structural 
genes, which together with appropriate control sequences may 
be inserted into suitable vectors and used to transform host 
cells. A significant problem with the integration of such 
25 a structural gene and control regions into the genome of a 
mammalian cell is that expression has been shown to be 
highly dependent upon the position of the inserted sequence 
in the genome. This results in a wide variation in the 
expression level and only very rarely in a high expression 
30 level. The problem of integration site dependence is solved 
by the present invention and arises from the discovery of 
specific sequences referred to herein as dominant control 
regions (DCRs) derived from immunoglobulin genes which have 
the property of conferring a cell-type restricted, 
35 integration site independent, copy number dependent 
expression characteristic on a linked gene system. 

Two mammalian gene systems have been shown to possess DCRs, 
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namely the JJ-like globin genes (Grosveld, F.G. g£ al, Cell, 
51, (1987), 975; International patent application PCT/GB 
88/00655), and the human CD2 T-cell marker gene (Lang, 6., 
et al, EMBO J, Z, (1988), 1675; International patent 
5 application PCT/GB 88/00655) . 

When a mammalian gene such as 0-globin containing all the 
usual control regions is introduced into transgenic mice, 
the gene is not expressed at the same level as the mouse /?- 

10 globin gene and exhibits integration site position effects. 
This is characterized by a highly variable expression of the 
transgene that is not correlated with the copy number of the 
injected gene in the mouse genome. The same phenomenon has 
been observed in almost all the genes that have been studied 

15 in transgenic mice (Palm iter et al . Ann. Rev. Genet., 
(1986) , £0, 465-499) . Moreover, the level of expression of 
each injected gene in the case of 0 -globin is, at best, an 
order of magnitude below that of the endogenous mouse gene 
(Magram et_al. , Nature, (1985) , US, 338-340; Tovnes et al . , 

20 EMBO J., (1985), 1 1715-1723; Kollias et al . Cell, (1986), 
46 . 89-94) • A similar problem is observed when the /9-globin 
or other genes are introduced into cultured cells by 
transf ection or retroviral infection. This poses a big 
problem when considering gene therapy by gene addition in 

25 stem cells. It is also a major problem for the expression 
of recombinant DNA products in cultured cells. Extensive 
screening for highly producing clones is necessary to 
identify cell-lines in which the vector is optimally 
expressed and selection for vector amplification or use of 

30 multicopy viral vectors is generally required to achieve 
expression levels comparable to those of the naturally 
occurring genes, such as for example 0-globin genes in 
erythroleukaemic cell-lines. 

35 In a chromosome, the genetic material is packaged into a 
DNA/protein complex called chromatin, one effect of which 
may be to limit the availability of DNA for functional 
purposes. It has been established that many gene systems 
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(including the 0-globin system) possess so-called DNasel 
hypersensitive sites. Such sites represent putative 
regulatory regions, where the normal chromatin structure is 
altered, for instance by interaction with regulatory 
5 proteins or to allow such interaction. 

Regions flanking the /3-like globin gene locus which contain 
a number of "super" hypersensitive sites have been 
identified. These sites are more sensitive to DNase I 

10 digestion in nuclei than the sites found in and around the 
individual genes when they are expressed (Tuan PNAS 
USA, (1985), 32, 6384-6388; Groudlne et al. PNAS USA, 
(1983), 80f 7551-7555). In addition, they are erythroid 
cell specific and they are present when any one of the 

15 globin genes is expressed. iJ , 

Tuan et al describe the broad mapping of four major DNase I 
hypersensitive sites in the 5' boundary area of the "0-like" 
globin gene. The authors note that certain sequence 
20 features of these sites are also found in many 
transcriptional enhancers and suggest that the sites, might 
also possess enhancer functions and be recognised by 
erythroid specific cellular factors. 

25 It has been discovered that the complete 0-globin gene with 
intact 5* and 3* boundary regions does not exhibit an 
integration site position dependence (see copending 
International patent application PCT/GB 88/00655). The 
regions of the locus responsible for this significant 

30 characteristic have been determined and shown to be 
associated with DNase I super hypersensitive sites. These 
dominant control regions are quite distinct from enhancers, 
exhibiting properties such as integration site independence 
not exhibited by the known enhancers. The dominant control 

35 region used in conjunction with the known promoter /enhancer 
elements reconstitute the full transcription rate of the 
natural gene. 
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Immunoglobulin genes have been extensively studied in order 
to identify sequences regulating gene expression. An 
immunoglobulin molecule consists of two identical heavy 
polypeptide chains and two identical light polypeptide 
5 chains. The light chains may be either of the k or X type. 
The genes encoding the heavy chain, the #c light chain and 
the X light chain are each located on separate chromosomes 
in the mouse and man. 

10 Unlike most genes which are transcribed from continuous 
genomic DNA sequences, immunoglobulin genes are assembled 
from gene segments which may be widely separated in the germ 
line . 

15 Functionally, heavy chain genes are formed by recombination 
of three genomic segments encoding the variable (V), 
diversity (D) and joining (J) /constant (C) regions of the 
molecule (Figure 1) • Functional light chain genes are 
formed by joining of two segments, one encoding the V region 

20 and the other the J/C region. Both the heavy chain and * 
light chain loci contain many V gene segments (estimates 
vary between 100s and 1000s) estimated to span well over 
lOOOkb (Figure 1). The X locus is, by contrast, much 
smaller and has recently been shown to span approximately 

25 300 kb on chromosome 16 in the mouse. It consists of four 
joining/ constant region gene segments and two variable gene 
segments (Figure 1) . Recombination resulting in functional 
genes occurs predominantly between V, and either J 1 /C 1 or 
Jj/Cj elements or between V 2 and J 2 /C 2 elements (J</C 4 is a 

30 pseudogene) although recombinations between V 2 and J3/C3 or 
Jj/Cj are seen very rarely. 

Control of transcription of both rearranged heavy and k 
light chain genes depends both on the activity of a tissue 
35 specific promoter upstream of the V region (Mason, J.O. §£. 
al, Cell, (1985), 11, 479; Bergman, Y. £fc air PNAS USA, 
(1984), £1, 7041) and a. tissue specific enhancer located in 
the J-C intron (Gillies, S.D. e£ air Cell, (1983), 22/ 717; 
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Banerji, J. g£ al, Cell, (1983) , 31, 729; Plcard, D. et al f 
Nature, 307 . 80) . These elements act synergist ically 
(Garcia, J.V. e£ al, Nature, (1986), 322 . 383). Recently a 
second B-cell specific enhancer has been identified in the 
5 k light chain locus (Meyer, K.B. et al, EMBO J., (1989), £, 
No. 7,1959-1964). This further enhancer is located 9 kb 
downstream of C ( . 

More recently Bich-Thuy and Queen (1989 NAR 17:5307) 
10 described an enhancer activity immediately downstream of the 
rearranged A 1 gene. Sequences downstream of the A t gene 
increased expression of a chloramphenicol acetyl transferase 
(CAT) reporter gene linked to the A, promoter in a myeloma 
cell line which made lambda light chains ( J558L) but not in 
15 two myeloma cell lines which make * light chains. 

This enhancer activity differs in several ways from that of 
the heavy chain and « light chain enhancers. Firstly, it 
displayed a marked orientation preference. Secondly, it 

20 consists of several segments which can independently 
stimulate transcription and which are spread over about 4 
kb of DNA immediately downstream of the A, coding sequence. 
Thirdly, it is apparently only expressed in A chain 
producing myeloma cells and not in cells producing « light 

25 chains. 

Spandidos and Anderson (1984 FEBS Lett. 175: 152) describe 
an 8kb fragment (containing 2 constant region gene segments) 
from the human A locus which increases the number of 6418 

30 resistant colonies obtained after transfection of myeloma 
cells with a plasmid containing the 8kb fragment linked to 
the aminoglycoside-phosphotransferase (aph) gene under the 
control of an e-globin promoter. Levels of aph specific 
mRNA were reported to be increased following transient 

35 transfection of this construct when compared with a 
construct lacking the 8kb sequence. However, no controls 
were presented for transfection efficiency in the transient 
assays and no further reports relating to enhancer activity 
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in the human x locus have been published. 

DNA fragments carrying rearranged heavy or k light chain 
genes are expressed when transf ected into lymphoid cells 
5 although generally at least 10 times less efficiently than 
endogenous immunoglobulin genes (Oi, V.T. e£ si, PNAS USA, 
(1983), fi.fi, 825; Neuberger, M.S., EMBO J., (1983), 2, 1373). 
Expression of a similar DNA fragment containing a rearranged 
Al gene however is not detectable in transf ected lymphoid 
10 cells and expression is only observed when an SV40 enhancer 
is added (Picard, D. et al . Nature, (1984), Ifil, 80;Cone, 
R.D. at al, Science, (1987), 226, 954). This is apparently 
due to the absence of a functional enhancer such as that 
present in the J-C intron of the « gene. 

15 

Although promoter and enhancer elements associated with * 
and heavy chain genes are sufficient to confer lymphoid cell 
specific expression of antibody genes, the level of 
expression is reduced compared with expression of endogenous 

20 antibody genes and genes are subject to position effects 
such that no clear relation exists between the level of 
expression and the copy number of the introduced gene (Oi, 
V.T. fit Al; PNAS USA, (1983), fifi* 825; Neuberger, M.S., EMBO 
J., (1983), Zi 1373). Similar results are obtained when * 

25 and heavy chain genes are introduced into transgenic mice 
(Storb U. , et al, Ann. Rev. Immunol. , 5_, (1987) , 151) . 

Sequences required for full regulation of antibody gene 
expression are therefore lacking from constructs used to 
30 date in expression studies (Grosschedl, R. et al . Cell, 
(1985) , 55, 645) . 

We have now identified super hypersensitive sites in the 
rearranged a 1 mouse immunoglobulin gene outside of the 
35 region previously suggested to be involved in expression of 
the A r gene. The strong implication of our finding is that 
immunoglobulin gene expression shares a common feature with 
0-globin and CD2 expression, namely the existence of 
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dominant control regions. 

Many active genes have hypersensitive sites associated with 
their promoters. For example ,. a tissue specific DNasel 
5 hypersensitive site is observed 300bp upstream from the 
start of the coding region of the rearranged * 
immunoglobulin gene ( Chung , S-Y e£ al. PNAS USA, £Q, 2427) . 
No such site was identified in the rearranged A 1 gene which 
was studied in the present work nor was any site identified 

10 in the J-C intron where hypersensitive sites associated with 
immunoglobulin enhancers have been identified in both the 
heavy chain (Mills, F. et ai., (1984), Nature, 2Q£, 807) and 
k light chain genes (Chung, S-Y et al , PNAS USA, 80, 2427; 
Parslow,T.G. et al . NAR, (1983), H, 4775); Weischet fit al, 

15 NAR, (1982) , Ifl, 3627). 

The hypersensitive sites characteristic of the dominant 
control regions of the 0-globin locus identified by Grosveld 
efc al (Grosveld, F.G. et al . Cell, (1987) , 51, 957-985) are 

20 referred to as "super hypersensitive 19 • The designation 
"super" hypersensitive refers to the fact that these* sites 
are much more sensitive to DNas6I that the "normal" 
hypersensitive site associated with the /3-globin promoter 
(Groudine, M. et al# PNAS USA, (1983), fifl, 7551). In the 

25 mouse X locus, promoter hypersensitive sites are not 
detectable and the only DNasel hypersensitive sites which 
are seen, map at larger distances from the gene. 

The hypersensitive sites in the mouse X locus indicate 
30 locations of sequences which, according to the present 
invention confer dominant control of immunoglobulin genes. 
On the basis of this mapping, DNA sequences flanking the 
rearranged gene in J558L have been cloned. 

35 The present invention is applicable to the production of 
transgenic animals and the techniques for producing such are 
now widely known. For a review, see Jaenisch, Science, 
(1988), 2±0, 1468-1474. 
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The present invention provides a solution to the problem of 
integration site dependence of expression making possible 
the insertion of functionally active gene systems into 
5 mammalian genomes both in vitro and ln_yiyo. Specifically, 
the present invention provides a vector capable of 
expressing homologous and heterologous genes after their 
introduction into immunoglobulin producing cells, whether In 
vivo or in vitro , in an integration-site independent cell 
10 specific manner. 

We have found also that certain of the hypersensitive sites 
in the mouse X locus correspond to new useful x 
immunoglobulin enhancer elements. Thus the present 
15 invention further provides these new enhancer elements and 
vectors containing these elements, useful for the expression 
of heterologous genes. 

Summary of the Invention 

20 

According to a first aspect of the present invention, there 
is provided a vector for the integration of a gene into the 
genetic material of a mammalian host cell such that the gene 
may be expressed by the host cell, the vector comprising a 
25 promoter and the said gene, characterised in that the vector 
includes an immunoglobulin gene dominant control region 
capable of eliciting host cell-type restricted, integration 
site independent, copy number dependent expression of the 
said gene. 

30 

As used herein the term "dominant control region" means a 
seguence of DNA capable of conferring upon a linked gene 
expression system the property of host cell-type restricted, 
integration site independent, copy number dependent 
35 expression when integrated into the genome of a host cell 
compatible with the dominant control region. Such a 
dominant control region retains this property when fully 
reconstituted within the chromosome of the host cell. The 
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ability to direct efficient host cell-type restricted 
expression is retained even when fully reformed in a 
heterologous chromosomal position. 

5 It is hypothesised that the dominant control regions of the 
invention may open the chromatin structure of the DNA, 
making it more accessible and thus may act as a locus 
organiser . 

10 The dominant control region may be a single contiguous 
sequence corresponding to, or derived from a naturally 
occurring gene system or may consist of two or more such 
sequences linked together with or without intervening 
polynucleotides . 

15 

The dominant control region may be derived by recombinant 
DNA techniques from a naturally occurring gene system or may 
correspond to a naturally occurring gene system in the sense 
of being manufactured using known techniques of 
20 polynucleotide synthesis from sequence data relating to a 
naturally occurring gene system. Alterations of the 
sequence may be made which do not alter the function of the 
dominant control region. 

25 The dominant control region consists of, includes, is 
derived from, corresponds to, or is associated with, one or 
more DNase I super hypersensitive sites of an immunoglobulin 
gene. 

30 The dominant control region may comprise, for example, one 
or more of the DNase I super hypersensitive sites 
specifically identified herein. 

Other sequences might, however, contribute to or exhibit 
35 the functional characteristics of a dominant control region. 

A number of DNase super hypersensitive sites have been 
identified: 
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about 2 . 4kb upstream of the CAP site of the rearranged 
X] gene, 

about 2.5kb upstream of the genomic VA 2 segment, and 

additional sites located 3 1 to C\,, and/or C\ A as 
follows: 

-r about 17 kb 3' of CA 4 

- between 28 and 48 kb (i.e about 30kb) 3' of CA, 

Any of the above-identified super hypersensitive sites may, 
on their own, constitute a dominant control region or may 
form part of a dominant control region. 

The methods used to identify these super hypersensitive 
sites may be used to identify functional dominant control 
regions. 

Alternatively functional expression assay methods may be 
used to identify functional dominant control regions. For 
example, expression assays using an appropriate reporter 
gene, such as an immunoglobulin gene may be used, wherein 
dominant control regions are identified by their capability 
of eliciting host cell-type restricted, integration site 
independent, copy number dependent expression of the 
reporter gene. 

Where the naturally occurring dominant control region 
30 comprises two or more subsequences separated by an 
intervening polynucleotide sequence or sequences, the 
dominant control region may comprise two or more of the 
subsequences linked in the absence of all or a part of one 
or more of the intervening sequences. Thus, if the 
35 immunoglobulin dominant control region comprises two or more 
discrete subsequences separated by intervening non- 
functional sequences, (for example, two or more super 
hypersensitive sites) the vector of the invention may 
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comprise a dominant control region comprising two or more of 
the subsequences linked together with all or part of the 
intervening sequences removed. 

5 one or more super hypersensitive sites may alone or 
collectively constitute a useful enhancer element derived 
from the mouse \ immunoglobulin locus. 

It will be understood that a super hypersensitive site or a 
10 group of such sites which does not exhibit integration site 
independence is to be considered as an enhancer not a 
dominant control region. 

Enhancers are sequences which activate transcription. They 
15 may be located upstream (for example, the SV40 early region 
enhancer, Banerji e£ al 1981 Cell 22: 299), downstream (for 
example the 0-globin enhancer, Choi and Engel 1986 Nature 
323:731), or within (for example the immunoglobulin heavy 
chain enhancer, Gillis efc &1 1983 Cell 22: 717, Banerji e£ 
20 al 1983 Cell 32: 729) the transcribed region of the gene. 

Enhancers activate transcription in a manner which is 
relatively independent of their distance from and 
independent of their orientation with respect to the gene 
25 and they are not promoter specific. For example, the 
immunoglobulin heavy chain enhancer can activate 
transcription from a /3-globin promoter (Picard and Shaffner 
1984 Nature 307: 80-82). 

30 Some enhancers are active in a variety of different cell 
types (for example, the enhancers from SV40 and Rous Sarcoma 
Virus (RSV) ) . Other enhancers show cell type specificity 
(for example, the immunoglobulin enhancers are only active 
in lymphoid cells see Maniatis e£ al 1987 Science 236: 1237- 

35 1245 for a review). This cell-type specificity Of enhancer 
activity can be seen in both cultured cells and transgenic 
mice (transgenic mice reviewed by Palmiter and Brinster 1986 
Ann. Rev. Genet. 2Q. 465-499) . 
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According to a second aspect of the present invention, there 
is provided a functional enhancer element derived from a 
mouse A immunoglobulin locus. The mouse A immunoglobulin 
5 enhancer element may comprise one or more super 
hypersensitive sites or a separate region within the mouse 
A immunoglobulin locus. 

The second aspect of the invention includes a vector 
10 including an enhancer of the second aspect of the invention. 
In the case of an expression vector the enhancer is in 
functional association with a promoter (suitably an 
immunoglobulin promoter) such that the enhancer activates 
transcription of a gene downstream of the promoter. 

15 

Such an enhancer sequence may contribute to or exhibit the 
functional characteristics of a dominant control region. 

We have identified a region downstream of the mouse CA 1 gene 
20 segment which possesses enhancer activity and may be a 
further dominant control region or a part thereof in 
combination with the super hypersensitive sites of the first 
aspect of the invention. 

25 The region consists of a DNA sequence comprising all or a 
functional part of the DNA sequence between the EcoRI site 
3.8 kb downstream of the Xhol site in the CA 1 gene segment 
of the rearranged A 1 gene and the SnaBI site 10 kb 
downstream of this Xhol site. 

30 

As used herein, the expression "functional" as applied to an 
enhancer connotes a DNA sequence which is capable of 
activating transcription from a promoter in a manner which 
is independent of its distance from or orientation with 
35 respect to the gene and is not promoter specific. 

As used herein, the expression "downstream 11 connotes in the 
direction of transcription. 
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The, mouse immunoglobulin A 1 enhancer may be a single 
contiguous sequence corresponding to, or derived from a 
naturally occurring mouse immunoglobulin A, gene system or 
5 may consist of two or more such sequences linked together 
with or without intervening nucleotides or polynucleotides. 

The mouse immunoglobulin A, enhancer may be derived by 
recombinant DNA techniques from a naturally-occurring gene 

10 system or may correspond to a naturally occurring gene 
system in the sense of being manufactured using known 
techniques of polynucleotide synthesis from sequence data 
relating to a naturally occurring gene system. Alterations 
of the sequence may be made which do not alter the function 

15 of the mouse immunoglobulin A, enhancer. 

The sequence between the EcoRI site 3.8 kb downstream of the 
rearranged mouse A 1 gene and the SnaBI site 10 kb downstream 
of the Xhol site in the rearranged mouse A, gene contains or 
20 overlaps two Hindlll fragments both of which independently 
enhance expression of the A, gene in SP2/0 cells. 

The enhancer of the invention may comprise all or a 
functional part of the 1.3 kb first Hindlll to Hindlll DNA 
25 fragment downstream of the EcoRI site 3.8 kb downstream of 
the Xhol site in the rearranged mouse A, gene.' 

With reference to Figure 11, the said 1.3 kb Hindlll to 
Hindlll fragment downstream of the EcoRI site 3.5 kb 
30 downstream of the rearranged mouse A, gene is that labelled 
"HIIIA" 

The enhancer of the invention may comprise all or a 
functional part of the 3.3 kb Hindlll to Hindlll DNA 
35 fragment downstream of the EcoRI site 3.8 kb downstream of 
the Xhol site in the rearranged mouse A 1 gene and spanning 
the SnaBI site 10 kb downstream of the Xhol site in the 
rearranged mouse A 1 gene. 
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With reference to Figure 11, the said 3,3 kb Hindlll to 
Hindlll fragment downstream of the EcoRI site 3.8 kb 
downstream of the Xhol site in the rearranged mouse A 1 gene 
5 and spanning the SnaBI site 10 kb downstream of the Xhol 
site in the rearranged mouse A 1 gene is that labelled 
"HIIIC" . 

The term "vector" as used herein connotes in its broadest 
10 sense any recombinant DNA material capable of transferring 
DNA from one cell to another. 

The vector may be a single piece of DNA in linear or 
circular form and may, in addition to the essential 

15 functional elements of the invention, include such other 
sequences as are necessary for particular applications. 
For example, the vector may contain additional features such 
as a selectable marker gene or genes, and/or features which 
assist translation or other aspects of the production of a 

20 cloned product. 

A vector suitable for integration consists of an isolated 
DNA sequence comprising an immunoglobulin dominant control 
region or enhancer and an independent structural gene 

25 expression system. The DNA sequence is not linked at either 
end to other substantial DNA sequences. The isolated DNA 
sequence may however be provided with linkers for ligation 
into a vector for replication purposes or may be provided 
with sequences at one or both ends to assist integration 

30 into a genome. 

The vector including a dominant control region defines an 
independent locus which can express homologous and 
heterologous genes after integration into a mammalian host 
35 cell normally capable of immunoglobulin expression, in an 
integration site independent, copy number dependent manner. 

The invention also provides a transfer vector, suitably in 
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the form of a plasmid, comprising an immunoglobulin dominant 
control region or an enhancer of the second aspect of the 
invention. 

5 The transfer vector may comprise an isolated functional 
mouse immunoglobulin A enhancer selected from: 

i) the sequence between the EcoRI site 3,8 kb downstream 
of the ( Xhol site in the rearranged mouse A t gene and the 

10 SnaBI site 10 kb downstream of the Xhol site in the 
rearranged mouse A 1 gene, 

ii) all or a functional part of the 1.3 kb first Hindlll 
to Hindlll DNA fragment downstream of the EcoRI site 3.8 kb 

15 downstream of the Xhol site in the rearranged mouse A 1 gene 
(Figure 11: that labelled "HIIIA") , , 

iii) all or a functional part of the 3.3 kb Hindlll to 
Hindlll DNA fragment downstream of the EcoRI site 3.8 kb 

20 downstream of the Xhol site in the rearranged mouse A 1 gene 
and spanning the SnaBI site 10 kb downstream of the Xhol 
site in the rearranged mouse A, gene (Figure 11: that 
labelled "HIIIC") , 

25 The term "gene" as used herein connotes a DNA sequence/ 
preferably a structural gene encoding a polypeptide. The 
polypeptide may be a commercially useful polypeptide , such 
as a pharmaceutical, e.g. a plasminogen activator, 
lymphokine or potentiating or stimulating factor, or an 

3 0 immunoglobulin polypeptide, e.g. a chimeric, humanized or 
altered immunoglobulin polypeptide. Th$ polypeptide may be 
entirely heterologous to the host cell. Alternatively, the 
gene may encode a polypeptide which is deficient absent or 
mutated in the host cell. 

35 

The mammalian host cell may be any mammalian host cell 
susceptible to uptake of the vector of the invention and 
capable of immunoglobulin gene expression. The vector DNA 
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may be transferred, to the mammalian host cell be 
transfection, infection, microinjection, cell fusion, or 
protoplast fusion. 

5 The host cell may be a cell of a living human or an animal. 
In particular, the host cell, may be a cell of a transgenic 
animal such as a mouse. The host cell may be derived from 
tissue in which the dominant control region is functional, 
such as B-cells or precursors thereof (where the dominant 
10 control region may not be functional), such as stem cells. 
The cell may be a lymphoid cell such as a myeloma or 
hybridoma. 

The promoter may be any promoter capable of functioning in 
15 the host cell and may be for example a mammalian or viral 
promoter. Optionally, the promoter may be homologous with 
the gene locus of the dominant control region and may be 
present in tandem with another promoter such as a viral tk 
or other viral promoter and may include one or more enhancer 
20 elements. Examples of suitable viral promoters include the 
SV40 late, Mo MLV LTR, RSV LTR, and hCMV MIE promoters. 

The immunoglobulin dominant control region may be from the 
A, k or heavy chain locus or from other genes specifically 
25 expressed in B-cells. 

In a further aspect of the invention, there is provided a 
method of producing a polypeptide comprising culturing a 
host cell transformed with a vector of the invention. 

30 

The method may be applied in vitro to produce a desired 
polypeptide. In addition, the method may be applied in vivo 
to produce a polypeptide having no therapeutic value to the 
animal. Such a method of producing a polypeptide is not to 
35 be considered as a method of treating the human or animal" 
body. 

In a further aspect of the invention the vector may be used 
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in a method of treatment of the human or animal body by 
replacing or supplementing a defective mammalian gene. 

Many diseases of the human or animal body result from 
5 deficiencies in the production of certain gene products* 
The characterising features of the vectors of this invention 
make them amply suited to the treatment of deficiencies by 
gene therapy in vivo . 

10 A method of gene therapy is provided comprising removing 
stem cells from the body of a human or ah animal , optionally 
killing stem cells remaining in the body, transforming the 
removed stem cells with a vector of the invention containing 
a gene deficient, or absent, in the human or animal body, 

15 and replacing the transformed stem cells in the human or 
animal body. 

This method can be used to replace or supplement a gene 
deficient in a human or animal. For example, an individual 

20 suffering from adenosine deaminase deficiency, severe 
combined immune deficiency (SCID) or hgprt deficiency. Thus 
the method may be used specifically in the treatment of B- 
cell genetic defects but also in the treatment of general 
metabolic genetic disorders wherein the cell source of the 

25 gene product is not critical. 

The invention is now described by way of example with 
reference to the accompanying drawings. 

30 Brief Description of the Drawings 

Fig 1 shows the structure of the mouse immunoglobulin X , * 
and heavy chain locii. 

35 Fig 2 shows the rearrangements on each mouse chromosome 16 
of the A locus in myeloma cell-line J558L. 

Fig 3 shows the rearranged mouse A 1 gene in the myeloma 
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cell-line J558L. 

Fig 4 shows DNasel fadeouts on myeloma cell-line J558L 
nuclei cut with a) ApaLI (A) and with ApaLI and Xhol 
5 (B) , probed with a CA 1 probe and b) Sphl(A) and with 

SphI and XhoI(B) probed with a VA 1 probe. 

Fig 5 shows DNasel fadeouts on nuclei from J558L and MEL 
nuclei each digested with Kpnl and probed with a VA t 
10 probe. 

Fig 6 shows a mapping of the DNasel hypersensitive sites 
identified in the J558L genome (3* hypersensitive 
sites are indicated in parenthesis) . 

15 

Fig 7 shows restriction maps of the pLTC cosmid cloning 
vector (a) and of fragments of J558L DNA cloned at 
the BamHI site of pLTC. (b) The position of the 7.5 
kb EcoRI fragment carrying the A 1 gene is shown (all 

20 other EcoRI sites have been omitted for clarity) . All 

ApaLI sites mapping 3 1 of the A 1 gene in cos 4N and 
cos 3 1 . 3 are shown whereas in cos 2B only the ApaLI 
site 5 1 of the A 1 gene is shown. ApaLI sites lying 
further upstream of this site have not yet been 

25 mapped. 

Fig 8 shows the results of SI protection analysis on RNA 
from various constructs in SP2/0 using a A 1 mRNA- 
specific probe (Lane 1 - 1/xg J558L RNA: Lane 2 - lOjzg 
30 SP2/0 RNA: Lane 3 - 10/xg SP2/0-cos2B RNA: Lane 4 - 

lOjig SP2/0-COS4N RNA: Lane 5 - lOjxg SP2/0-pLTCA 1 RNA: 
Lane 6 - 10/xg SP2/0-pSVneoA 1 RNA) , 

Fig 9 shows the restriction maps of fragments of cos4N 
35 prepared on salt gradients and used to transf ect 

SP2/0, 

Fig 10 shows the results of SI protection analysis on RNA 
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from the fragments shown in Figure 9 in SP2/0 using 
a A 1 mRNA-specific probe ( Lane 1 - 10/xg SP2/6 RNA: 
Lane 2 - 1/xg J558L RNA: Lanes 3 and 4 - 10/xg SP2/0- 
cos4N RNA: Lanes 5 to 9 - 10/xg SP2/0-cos4N ApaLI 24.5 
5 kb fragment RNA: Lane 10 - pBR322 Hinfl markers: Lanes 

11 to 14 - 10/xg SP2/0- cos4N ApaLJ/SnaBJ19kb fragment 
RNA), 

Fig 11 shows the restriction map of the enhancer region of 
10 the mouse A 1 gene including the two Hindlll fragments 

• thereof which exhibit enhancer activity , 

Fig 12 shows the restriction maps of the plasmid, pLTCAj, 
used to test putative enhancer fragments for 
15 transcription activation by stable transf ection of 

SP2/0. 

Fig 13 shows the results of si protection analysis on RNA 
from the vectors shown generally in Figure 12 (a) in 

20 SP2/0 using a A t mRNA-specific probe ( Lane 1 - 10/xg 

SP2/0 RNA: Lane 2 - 1/xg J558L RNA: Lane 3 - 10/xg 
SP2/0-cos4N RNA: Lanes 4 and 5 - SP2/0 pLTCAlHIIIAa 
10/xg RNA: Lanes 6 and 7 - SP2/0 pLTCAj HIIIA/3 10/xg 
RNA: Lanes 8 and 9 - SP2/0 pLTCA 1 10/xg RNA: Lanes 10 

25 and 11 - SP2/0 pLTCA, HHICa 10/xg RNA: Lanes 12 and 13 

- SP2/0 pLTCA, HIIIC0 10/xg RNA). 

Fig 14 shows SI nuclease protection analysis of mRNA from 
tissues of transgenic (36, 37 and 38) and non- 
30 transgenic (39 and 40) littermates of the mouse line 
2071. 

T = thymus 
S = spleen 
35 L = liver 

B = brain 



Each track contains 10/xg of RNA hybridised with a 5 ' 
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cDNA probe and a 5« /3 actin cDNA probe. 1 /xg of 
J558L RNA was hybridised with the same probes as a 
positive control. 

5 The position of migration of the protected fragments 

is indicated by the arrow. 

Detailed Description of Embodiments 

10 1. Identification of Hypersensitive Sites in J558L Nuclei. 

Hypersensitive sites (HSSs) associated with dominant control 
regions (DCRs) were sought at large distances from the 
immunoglobulin locus. Southern blots were carried out on 

15 myeloma cell-line J558L DNA cut with a variety of 6 base- 
cutter restriction endonucleases to identify enzymes which 
generate large fragments carrying the rearranged A t gene. 
Two such enzymes were ApaLI and SphI which generated 35kb 
and 50kb fragments carrying the A 1 gene respectively. 

20 DNasel fadeouts were performed on J558L nuclei and DNA 
extracted from these nuclei was digested with ApaLI or SphI 
and the fragment pattern was analysed by Southern blotting 
using a variety of probes. The DNasel fadeouts were carried 
out as described by Enver et al (Enver, T. et al . Nature, 

25 (1985) , 318 , 680-683) , with the modification described by 
Grosveld et al (Grosveld F.G. et al . Cell, (1987), 51, 975- 
985) . DNA was prepared from DNasel-treated nuclei using 
the procedure of Luster et al (Luster et al . Mol. Cell 
Biol., 7, 3723) . 

30 

The probes used were: 

1. a probe from the VA 1 region (0.8kb Xbal fragment) which 
hybridises to both VA, and VA 2 which are highly conserved, 

35 

2. a probe from the C region of A, (lkb BamHI-Kpnl) which 
hybridises to CA 1 and CA A , and 
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3. a probe from the 3 f region of CA, (2kb KpnI-Xbal) which 
is a unique sequence and recognises only CA^associated 
sequences (see Figure 3). 

5 Probing ApaLI or SphI blots with each of these three probes 
yielded a number of hypersensitive subbands (Figure 4a and 
4b) . In order to map these sites more closely , the ApaLI or 
SphI digested samples were also digested with Xhol (which 
cuts in the CA 1 sequence) • A 4.3kb and a 3kb band were seen 

10 on Sphl/Xhol and ApaLI/XhoI blots when probed with the V 
region probe. The 4.3kb band was also seen with CA 1 probe, 
showing that the A t gene contains a hypersensitive site 
2.4kb upstream of the CAP site. The 3kb: subband maps 2.5kb 
upstream of the genomic VA 2 segment. These positions have 

15 been confirmed by mapping with respect to a number of other . 
enzymes (not shown) . 

In addition to these mapped sites, we have detected a number 
of other hypersensitive sites using the V, C and 3 1 probe 
20 (Figure 4a and 4b). All of these are associated with the A 
locus but cannot be unequivocally assigned to particular 
positions. What is clear is that some of these are located 
to the 3 1 side of the various constant regions. 

25 We therefore decided to clone sequences flanking the A genes 
to isolate the sequences which contain these hypersensitive 
sites, starting with the A 1 gene. 

DNasel fadeouts were performed on nuclei from MEL cells as 
30 a control. These were digested with Kpnl and probed with a 
VA 1 probe. Whereas hypersensitive bands (2.6kb) can be seen 
on the J558L Kpnl blots none are seen on the MEL blots 
(Figure 5) . 

35 2. Construction of a J558L Cosmid Library 

In order to clone large fragments flanking the rearranged A 
gene, a cosmid library was prepared from DNA prepared from 
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J558L cells. A partial Mbol digest of J558L DNA was cloned 
at the BamHI site of the vector pLTC which is identical to 
pTCF (Grosveld, F.G. et al M NAR, (1982), lfl, 6715) except 
that the Hpal site has been replaced by an Xhol site which 
5 was used in place of the Hpal site for the preparation of 
vector arms. 

The cosmid library was prepared using the procedures 
described by Grosveld et al (Grosveld, F.G. sfe &I t NAR, 

10 (1982) , 10, 6715) . Stratagene Gigapack Gold packaging 
extracts were used according to the manufacturer's 
instructions and the packaged cosmids were introduced into 
E coli MC1061 rec A. The library was plated onto four 20 x 
20 cm NEN Genascreen Plus hybridisation membranes and 

15 screened as described by Grosveld et al (Gene, (1981), 13, 
227) . Hybridisation was carried out using conditions 
recommended by NEN. 

The library (which consisted of 4 x 10 6 individual clones) 
20 was screened with the CA 1 (lkb BamHI - Kpnl) probe and 
hybridising colonies were purified. The resulting cosmids 
were analysed by Southern blotting using VA t and 3 1 probes 
to establish the length of 5' and 3 f flanking sequences. 
Approximately 20kb of 5» flanking and 20kb and 3 1 flanking 
25 sequence was isolated in this way. Cos 2B contains the 
7.5kb EcoRI fragment containing the A t gene and 
approximately 20kb of 5 1 flanking sequence and cos 4N 
contains the 7.5kb EcoRI fragment with approximately 20kb of 
3 1 flanking sequence (Figure 7) including the ApaLI site 3 1 
30 of the A 1 gene. 

The library was further screened with a V region probe 
(0.5KB SstI VA 2 region segment) in order to clone sequences 
flanking the 5' part of the mouse A locus which may be 
35 important in regulating expression. 

The library was also screened with a probe from the 3 1 end 
of cos4N (an 0.5kb ApaLI/Sall fragment) in order to clone 
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sequences further 3' of the mouse lambda lofcus. Cos 3«.3 
was identified with this probe. Cos 3'. 3 contains 
approximately 14 kilobases of sequence overlapping the 3 • 
region of cos4N together with a further 19kb of 3 • sequence . 
5 A map of cos 3'. 3 is shown in Figure 7. 

3. Functional tests 

3.1 Expression in myelo ma cells. 

10 

Cosmids containing the A 1 gene and flanking sequences are 
tested directly by transfection into myeloma cell-lines* 
The cosmid vector carries a tk-neo gene allowing selection 
of clones in G418. These are then analysed for expression 

15 of the A 1 gene by SI nuclease analysis (for A, mRNA) and 
ELISA (for A, protein). The cosmids are trans fected into 
the cell lines Sp2/0.Agl4 (Shulman et a_l, Nature, (1978) , 
276 . 269) , a mutant hybridoma cell line which produces no 
antibody, and MOPC 315.26 a heavy chain loss mutant (Hozumi 

20 e£ a2, J. Immunol., (1982), 121, 260) which makes A 2 light 
chains. 

Sp2/0.Agl4 cells were transfected by electroporation using 
a Bio-Rad Gene Pulser . 5/zg of linear plasmid DNA was added 
25 to 10 7 cells in PBS and the cells were shocked at 250V at a 
capacitance setting of 960mF. After the shock, the cells 
were incubated at room temperature for' ten minutes before 
adding medium (otMEM, 10% Foetal Calf Serum) . G418 (lmg/ml) 
was added 24 hours after the shock. 

30 

SI mapping was carried out as described by Antoniou et al 
(Antoniou, M et al . In: Human Genetic Diseases - A Practical 
Approach, Ed. Davies, K.E., IRL Press, Oxford). 

35 A, antibody was measured by ELISA using a goat anti-mouse A 
light chain specific antibody preparation and a biotinylated 
derivative of the same (Southern Biotechnology) . The assay 
was developed using horseradish peroxidase coupled to avidin 
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and 2,2' azino bis (3-ethylbenzthiazoline-6~sulphonic acid) 
as substrate. 

After testing the effect of 5' flanking sequences (cos II 
5 and 2B) and 3' sequences alone (cos 4 J and 4N) , constructs 
containing both 5 ' and 3 1 flanking sequences are prepared by 
ligation at the Xhol site in the constant region of the A, 
gene (Figure 3) . 

10 The levels of expression of all the cosmid constructs can 
be compared with the level of expression of the 7.5kb EcoRI 
fragment containing the A 1 gene cloned in the pLTC and with 
the same fragment cloned in the vector pSVneo which contains 
an SV40 enhancer. The cell line J558L serves as a positive 
15 control for full endogenous expression of A 1 mRNA and. 
protein. 

pLTC A 1 and pSVneoA 1 (which both contain the 7.5kb EcoRI 
fragment containing the A T gene) were transfected into 
Sp2/0.Agl4 cells and populations of G418 resistant clones 
were analysed for expression of A 1 specific mRNA and 
protein. SI analysis using a 5' probe from a A 1 cDNA cloned 
in pSPT 18 (Pharmacia) (extending from the SphI site in the 
vector to the Xhol site in CA^ and protein analysis by 
ELISA indicated that the populations transfected with pSVneo 
A t expressed 100-fold less mRNA and protein than J558L 
whereas the cells transfected with pLTC A t did not express 
detectable levels of mRNA (Figure 8) or protein (data not 
shown) . This is in agreement with previously reported 
results. (Picard f D. , et al, Nature, (1984) , 307 , 80-82; 
Cone,R.D., et al, Science, (1987), 236 f 954). 

Populations of SP2/0 cells transfected with cos 4N (which 
contains the A 1 gene with 30 kb 3 1 flanking sequence) also 
35 expressed A 1 specific messenger RNA, by SI nuclease analysis 
(Figure 8: Lane 4) and synthesised A 1 protein, by ELISA. 

The level of expression in these populations is similar to, 



WO 90/10077 



25 



PCT/GB90/00286 



or rather higher than, that seen in populations transfected 
with pSV neoA 1 which contains the SV40 enhancer (Figure 8: 
Lane 6) . 

5 No expression of A, specific mRNA or protein could be 
detected in SP2/0 cells transfected with cos 2B (which 
contains the A, gene together with 22 kb of 5' flanking 
sequence: Figure 8-Lane 3). This is in spite of the 
presence in this cosmid of sequences approximately 2.4 kb 5' 
10 of the VA 1 to which a strong hypersensitive site maps in 
nuclei treated with DNase I. 

All populations which were analysed for mRNA were also 
checked for the presence of the transfected gene by 
15 Southern blotting of EcoRI digested genomic DNA using a 
probe from the 3 1 part of the CA 1 region (the 2kb KpnI/Xbal 
in Figure 3) • 

In order to determine how much of the 3 1 sequence in cos 4N 
20 was necessary for expression of the A 1# restriction 
fragments of cos 4N containing the tJc-neo gene, the A t gene 
and various amounts of 3* sequence were prepared i# on salt 
gradients and used to transfect SP2/0 cells. Populations of 
clones resistant to G418 were tested for expression of the 
25 A 1 gene by SI nuclease protection. These fragments are 
shown in Figure 9. Both the 24.5 kb ApaLI fragment and the 
19 kb ApaLI-SnaBI fragment gave populations of SP2/0 cells 
which expressed A 1 specific mRNA (Figure 10: Lanes 5 to 9 
and 11 to 14). 

30 

Hindlll fragments lying between the EcoRI site 3.8 kb 
downstream of the Xhol site in the A r gene and the SnaBI 
site 10 kb downstream of this Xhol site (Figure 11) were 
cloned into the Hlndlll site of pLTCA 1 (Figure 12). These 
35 plasmids were linearised at the single Pvul site and used to 
transfect SP2/0 cells. 

RNA from CT418 resistant populations was tested for the 
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presence of A 1 specific mRNA as before. A plasmid 
containing the 1.3 kb HihdllT fragment, HIIIA, expressed A 1 
mRNA as did plasmids containing the 3.3. kb Hlndlll fragment 
HIIIC (Figure 13: Lanes 4 to 7 and 10 to 14) . 

5 

In conclusion , an enhancer downstream of the mouse CA, gene 
segment has been identified. Enhancer activity lies between 
the EcoRI site 3.8 kb downstream of the Xhol site in the A 1 
gene and the SnaBI site 10 kb downstream of this Xhol site. 
10 This region contains two Hind III fragments (HIIIA 1.3kjS and 
HIIIC 3.3 kb) both of which enhance expression of the A 1 
gene in SP2/0 cells. 

3.2 Transgenic Mice. 

15 

The Sail fragments of cos 4N and cos 3».3 which contain all 
the cloned J558L DNA were purified using salt gradients. 
The cos 4N Sail fragment was injected into fertilised mouse 
eggs (C57BL/10 x CBA) FI, either alone or as a co- injection 

20 with the Sail fragment of cos 3'. 3, in order to generate 
transgenic mice containing the X, gene together with varying 
amounts of 3 f flanking sequence. The expression of the A t 
gene in these mice was monitored both by SI nuclease 
protection analysis of mRNA from lymphoid and non- lymphoid 

25 tissues and by fluorescence activated cell sorter analysis 
of A, protein expression on the surface of spleen cells. 

Two mouse lines were generated which carried both cos 4N and 
cos 3'.3 Sail fragments in their genome. Southern blot 
30 analysis using probes from the ends of the Sail fragments 
indicated that the cos 4N and cos 3 ".3 fragments were linked 
together at the same insertion site in the genome (data not 
shown) . 

35 The founder animals were bred with (C57BL/10 x CBA) FI mice 
and their offspring analysed 5-7 days after birth for 
expression of A 1 . RNA prepared from spleen, liver, thymus 
and brain was analysed by SI nuclease protection using a 
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probe from the 5' end of a cDNA. For analysis of cell 
surface expression of X y , spleen tissue was homogenised 
using a glass homogeniser and red cells were lysed using 
Tris-ammonium chloride. Approximately 10 6 cells were 
5 stained with phycoerythrin conjugated goat anti-mouse A 
(Southern Biotechnology) or fluorescein isothiocyanate 
conjugated goat anti-mouse n (Sigma) . Fluorescence analysis 
was carried out using a Beckman Dickinson FACSTAR-PLUS flow 
cytometer. Transgenic animals were identified by Southern 
10 blotting of tail DNA using probes from both cos 4N and cos 
3'.3. ■ 

For the two mouse lines, 2071 and 3002 , the transgenic 
offspring exhibited an elevation in both the number of 

15 spleen cells expressing lambda chains at the cell surface 
and in the level of A 1 mRNA detected in the spleen compared 
with non-transgenic littermates. Figure 14 shows an SI 
nuclease protection analysis of RNA from various tissues of 
two non-transgenic and three transgenic offspring of line 

20 2071. The three transgenic offspring show an approximately 
5-10 fold increase in the level of A 1 specific RNA in the 
spleen and no detectable expression in thymus , brain or 
liver. Similar results were obtained for mouse line 3002. 
Table 1 shows the results of FACS analysis of spleen cells 

25 from 5 day old offspring of lines 2071 and 3002. In both 
cases the number of IgM positive B cells expressing lambda 
chains was approximately 5 fold higher in the transgenic 
animals compared with their non-transgenic littermates. 
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Table 1 

FACS analysis of spleen cells from transgenic and non- 
transgenic littermates of mouse lines 2071 and 3002. 

5 

% spleen cells staining with 
Mouse Transgene % A 



10 





cos 4N 


cos 3 ' . 3 


PE-GAM X 


FITE-GAM IX 




2071/21 


X 


X 


5.0, 5.0 


15.5, 14.6 


33 


2071/22 


X 


X 


3.7, 3.3 


13.6, 12.7 


27 


2071/23 


X 


X 


1*4, 1.4 


20.9, 19.4 


7 


2071/24 






1.1, 1.0 


14.3, 15.5 


6 


3002/1 


X 


X 


4,57, 4.34 


7.89, 9.96 


50 


3002/2 


X 


X 


3.32, 2.54 


6.56, 6.43 


45 


3002/3 


X 


X 


0.39, 0.52 


6.16, 5.87 


7.5 


3002/4 


X 


X 


0.39, 0.40 


4.33, 5.16 


8.3 


3002/5 


X 


X 


2.87, 2.97 


7.62, 7.64 


38 


3002/6 


X 


X 


2.36, 2.91 


6.43, 6.81 


40 



Six mouse lines carrying various numbers of copies of the 
cos 4N Sail fragment alone were also analysed. No 
25 difference was observed either in the number of X expressing 
B cells or in the level of A, specific mRNA in the spleen 
between transgenic and non-transgenic offspring. 

We concluded from these data that an element which strongly 
30 enhances the expression of the mouse X^ gene resides in the 
sequences towards the 3 • end of cos 3 ' . 3 . The presence of 
DNasel hypersensitive sites in this region was investigated 
by digesting DNA from a DNasel fadeout of J558L nuclei with 
the enzymes Xhol and Pvul and probing Southern blots with a 
35 1.9 kb EcoRV-PvuI fragment from the 3' end of cos 3'. 3. The 
probe hybridises to a 28 kb fragment in J558L DNA. 
Preliminary data suggest that there may be two 
hypersensitive sites mapping approximately 8 kb and 12 kb 
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upstream of the Pvul site. J558L fadeouts were also 
digested with ApaLI and Southern blots probed with a 1 kb 
EcoRI / SnaBI fragment from cos 3 1 . 3 ( see Figure 7b) . 
Preliminary exposures of these blots suggest the appearance 
5 of a DNasel hypersensitive subband approximately 5 kb 
downstream of the most 3 1 ApaLI site in cos 3'. 3. There may 
therefore be 2 or 3 DNasel hypersensitive sites which map to 
the region lying between 28 and 48 kb (i.e. about 30kb) 
downstream of the Xhol site in the rearranged A t gene. 

10 

Both the mouse lines 3002 and 2071 contain between five and 
eight copies of each of the cos 4N and cos 3'.3 Sail 
fragments. Since approximately five times the number of B 
cells are expressing A r chains as in the hon-transgenic 

15 animals and the level of A^ specific RNA is increased, 
between 5 and 10 fold, the level of expression of the 
transgene appears to be approaching the level of expression 
of an endogenous A 1 allele. The generation of more mouse 
lines containing sequences from cos 4N and cos 3 '.3 will 

20 confirm whether expression levels are directly correlated 
with copy number. 

The data are consistent with the sequences from cos 3' .3 to 
which the hypersensitive site maps acting as a DCR. They 
25 are, in any case, acting as a strong, B cell specific 
enhancer. Possibly addition of 5' sequences containing the 
hypersensitive site kb upstream of the CAP site of the 
rearranged A 1 gene to the injected DNA may complete a 
functional DCR. 

30 

A variation on this approach to isolating OCR's involves 
using very large segments of the mouse lambda locus cloned 
in yeast artificial chromosomes (YACS) (Murray and Szostak, 
1983, Nature 305:189-193) . Several hundred kilobases of DNA 
35 can be cloned using YAC vectors. A YAC library prepared 
from J558L DNA would be screened for clones containing 
lambda sequences. These clones would be tested for 
expression of lambda either by transfection into myeloma 



WO 90/10077 



30 



PCT/GB90/00286 



cells or by injection into fertilised mouse eggs. Copy 
number dependent, tissue specific, high level expression 
would be indicative of the presence of a DCR and sequences 
responsible for this could be identified by DNasel 
5 hypersensitive site mapping. These could then be subcloned 
from the YAC clone. 

The present invention identifies, for the first time, 
sequences for dominant control regions associated with the 
10 mouse X immunoglobulin locus. 

In the cell line J558L, which synthesises A t light chains, 
hypersensitive sites have been identified 2.3 - 2.4kb 
upstream of the CAP site of the rearranged A t gene, 2.5kb 
15 upstream of the genomic VA 2 segment approximately 30 kb 3 1 
of the rearranged A 1 gene and approximately 17 kb 3' of the 
CA 4 gene segment. Sequences to which these sites map have 
been cloned from the J558L cell-line and their effect on A t 
gene expression tested in myeloma cells and transgenic mice. 

20 

The DNasel hypersensitive sites described here may represent 
"super" hypersensitive sites since no "normal" 
hypersensitive sites were detected associated with the A 1 
immunoglobulin promoter. It is possible that the DNasel 

25 hypersensitive sites are "super" hypersensitive because they 
comprise a cluster of hypersensitive sites, for example as 
has been shown for the j9-globin locus. .The location of the 
hypersensitive sites is also consistent with the probable 
location of DCRs by analogy with the jS-globin and CD2 loci 

30 (i.e. 5 1 and 3' of the locus). 

It will be understood that the invention is described by way 
of example only and modifications of detail may be made 
within the scope of the invention. 
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CLAIMS 

1. A vector for the integration of a gene into the genetic 
material of a mammalian host cell such that the gene may be 
expressed by the host cell, the vector comprising a promoter 
and the said gene, wherein the vector includes an 
immunoglobulin dominant control region capable of eliciting 
host c^ll-type restricted, integration site independent, 
copy number dependent expression of the said gene. 

2. A vector according to claim 1, wherein the dominant 
control region is derived from the mouse A immunoglobulin 
gene locus. 

15 

3. A vector according to claim 2 wherein the dominant 
control region comprises a DNasel super hypersensitive site 
which is about 2.35kb upstream of the CAP site of the 
rearranged A 1 gene. 

20 

4. A vector according to claim 2 wherein the dominant 
control region comprises a DNasel super hypersensitive site 
which is about 2.5kb upstream of the genomic VA 2 segment. 

25 5. A vector according to claim 2 wherein the dominant 
control region comprises a DNasel super hypersensitive site 
which is about 30kb downstream of the -rearranged A 1 gene. 

6. A vector according to claim 2 wherein the dominant 
30 control region comprises a super hypersensitive site which 

is about 17kb downstream of the CA 4 gene segment. 

7. A vector according to claim 2 wherein the dominant 
control region comprises in combination a DNase I super 

35 hypersensitive site which is about 2.35 kb upstream of the 
CAP site of the rearranged A 1 gene and a DNase I 
hypersensitive site which is about 30kb 3' of the rearranged 
A. gene. 
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8. A mammalian host cell transformed with a vector 
according to any one of the preceding claims . 

5 9. A transgenic mammal transformed with a vector according 
to any one of claims 1 to 7. 

10. A method of producing a polypeptide comprising 
culturihg a mammalian host cell transformed with a vector 

10 according to any one of claims 1 to 7. 

11. A method of gene therapy comprising the steps of 

i) removing stem cells from the body of a mammal , 
15 ii) killing stem cells remaining in the body, 

iii) transforming the removed stetn cells with a 
vector as defined in any one of claims 1 to 7 
containing a gene deficient or absent in the 
body, and 



20 



iv) replacing the transformed stem cells in the 
body. 

12. A method of gene therapy comprising the steps of 



25 



i) removing stem cells from the body of a mammal, 

ii) transforming the removed stem .cells with a 
vector as defined in any one of claims 1 to 7 
containing a gene deficient or absent in the 

30 body, and 

iii) replacing the transformed stem cells in the 
body. 

13. A functional enhancer element derived from the mouse 
35 A immunoglobulin locus. 

14. A functional mouse immunoglobulin A 1 enhancer 
consisting of a DNA sequence comprising all or a functional 
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part of the DNA sequence between the EcoRl site 3.8 kb 
downstream of the Xho, I site in the rearranged mouse A 1 gene 
and the SnaBl site 10 kb downstream of this Xho I site. 

5 15. A functional mouse immunoglobulin A 1 enhancer 
according to claim 14 comprising all or a functional part of 
the 1.3 kb first Hindlll to Hindlll DNA fragment downstream 
of the EcoRI site 3.8 kb downstream of the Xho I site in the 
rearranged mouse A 1 gene. 

10 

16. .A functional mouse immunoglobulin \ y enhancer 
according to claim 15, wherein the said 1.3 kb Hindlll to 
Hindlll fragment downstream of the EcoRI site 3.8 kb 
downstream of the Xho I site in the rearranged mouse A 1 gene 

15 is that labelled "HIIIA", in Figure 11. 

17 . A functional mouse immunoglobulin A, enhancer 
according to claim 14 comprising all or a functional part of 
the 3 . 3 kb Hindlll to Hindlll DNA fragment downstream of the 

20 EcoRI site 3.8 kb downstream of the Xho I site in the 
rearranged mouse A 1 gene and spanning the SnaBI site 10 kb 
downstream of this Xho I site. 

18. A functional mouse immunoglobulin Aj enhancer 
25 according to claim 17, wherein the said 3.3 kb Hindlll to 

Hindlll fragment downstream of the EcoRI site 3.8 kb 
downstream of the Xho I site in the rearranged mouse A, gene 
and spanning the SnaBI site 10 kb downstream of this Xho I 
site that is labelled "HIIIC" in Figure 11. 

30 

19. A functional enhancer element according to claim 13 
derived from the region about 30kb 3' of the rearranged 
mouse A 1 immunoglobulin gene. 

35 20. A functional enhancer element comprising a functional 
enhancer element according to claim 13 in combination with 
a functional enhancer element according to claim 19. 
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21. A vector comprising a functional mouse immunoglobulin 
A, enhancer according to any one of claims 13 to 20. 

22. A vector according to claim 21 suitable for 
5 integration consisting of an isolated DNA sequence 

comprising a functional mouse immunoglobulin A 1 enhancer 
according to any one of claims 13 to 20 and an independent 
gene expression system , provided that when the promoter of 
the independent expression system gene is the mouse A 1 
10 immunoglobulin promoter, the gene does not encode mouse A t 
immunoglobulin. 

23 ^ A vector according to claim 21 consisting of an DNA 
sequence comprising a replicon capable of ensuring 
replication of the vector in a host cell, a functional mouse 
immunoglobulin A enhancer according to any one of claims 13 
to 20 and an independent gene expression system, provided 
that when the promoter of the independent gene expression 
system is the mouse A, immunoglobulin promoter, the gene 
does not encode mouse A 1 immunoglobulin. 

24. A transfer vector comprising an isolated functional 
mouse immunoglobulin A 1 enhancer selected from: 

25 i) the sequence between the EcoRI site 3 .8 kb downstream 
of the Xho I site in the rearranged mouse A 1 gene and the 
SnaBI site 10 kb downstream of this Xho I site, 

ii) all or a functional part of the 1.3 kb first Hindlll 
30 to Hindlll DNA fragment downstream of the EcoRI site 3.8 kb 

downstream of the Xho I site in the rearranged mouse A t 
gene, 

iii) all or a functional part of the 3.3 kb Hindlll to 
35 Hindlll DNA fragment downstream of the EcoRI site 3.8 kb 

downstream of the Xho I site in the rearranged mouse A 1 gene 
and spanning the SnaBI site 10 kb downstream of this Xho I 
site. 
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25. A human or animal host cell transformed with a vector 
according to any one of claims 21 to 24. 

26. A method of producing a polypeptide comprising 
culturing a human or animal host cell transformed with a 
vector according to any one of claims 21 to 24. 
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